Using Machine Learning to Identify Intonational Segments
نویسندگان
چکیده
The intonational phrase is hypothesized to represent a meaningful unit of analysis in spoken language interpretation. We present results on the identification of intonational phrase boundaries from acoustic features using classification and regression trees (CART). Our training and test data are taken from the Boston Directions Corpus (task-oriented monologue) and the HUB-IV Broadcast News database (monologue and multi-party). Our goal is two-fold: (1) to provide intonational phrase segmentation as a front end for an ASR engine, and (2) to infer topic structure from acoustic-prosodic features. These efforts are aimed at improving the ease and flexibility of retrieving and browsing speech documents from a large audio database.
منابع مشابه
Machine Learning Models for Housing Prices Forecasting using Registration Data
This article has been compiled to identify the best model of housing price forecasting using machine learning methods with maximum accuracy and minimum error. Five important machine learning algorithms are used to predict housing prices, including Nearest Neighbor Regression Algorithm (KNNR), Support Vector Regression Algorithm (SVR), Random Forest Regression Algorithm (RFR), Extreme Gradient B...
متن کاملIntelligent application for Heart disease detection using Hybrid Optimization algorithm
Prediction of heart disease is very important because it is one of the causes of death around the world. Moreover, heart disease prediction in the early stage plays a main role in the treatment and recovery disease and reduces costs of diagnosis disease and side effects it. Machine learning algorithms are able to identify an effective pattern for diagnosis and treatment of the disease and ident...
متن کاملAcoustic Classification of Focus: On the Web and in the Lab
We present a new methodological approach which combines both naturally-occurring speech “harvested” on the web and speech data elicited in the laboratory. This proof-of-concept study examines the phenomenon of focus sensitivity in English, in which the interpretation of particular grammatical constructions (e.g. the comparative) is sensitive to the location of prosodic prominence. Machine learn...
متن کاملAn analysis of prosodic information for the recognition of dialogue acts in a multimodal corpus in Mexican Spanish
This paper presents empirical results of an analysis on the role of prosody in the recognition of dialogue acts and utterance mood in a practical dialogue corpus in Mexican Spanish. The work is configured as a series of machine-learning experimental conditions in which models are created by using intonational and other data as predictors and dialogue act tagging data as targets. We show that ut...
متن کاملAutomatically Derived Discourse Segmentation Algorithms Based on Acoustic-Prosodic Features
We describe an investigation aimed at furthering the understanding of how speakers communicate discourse structural information using intonational features. We used the read and spontaneous speech of two speakers from the Boston Directions Corpus (BDC) to automatically identify elements of discourse structure based on intonational features. Unlike previous acoustic-prosodic analyses of discours...
متن کامل